Facilitated diffusion of proteins on chromatin 



O. Benichou, 1 C. Chevalier, 1 B.Meyer, 1 and R. Voituriez 1 

1 Laboratoire de Physique Theorique de la Matiere Condensee CNRS-UPMC, 4 Place Jussieu, 75255 Paris Cedex 

(Dated: January 27, 2011) 

We present a theoretical model of facilitated diffusion of proteins in the cell nucleus. This model, which takes 
into account the successive binding and unbinding events of proteins to DNA, relies on a fractal description of 
the chromatin which has been recently evidenced experimentally. Facilitated diffusion is shown quantitatively to 
be favorable for a fast localization of a target locus by a transcription factor, and even to enable the minimization 
of the search time by tuning the affinity of the transcription factor with DNA. This study shows the robustness 
of the facilitated diffusion mechanism, invoked so far only for linear conformations of DNA. 

PACS numbers: 



The now well established theory of facilitated diffusion ex- 
plains how DNA-binding proteins can in principle find their 
target sites on DNA efficiently. This model describes search 
trajectories as alternating phases of free diffusion in the bulk 
cytoplasm and 1 -dimensional diffusion along the DNA strand, 
called sliding, which is made possible by sequence indepen- 
dent interactions of proteins with DNA. Since the seminal 
work HI, such pathways have been evidenced experimentally 
both in vivo GJ and in vitro B]-[5) thanks to single molecule 
technics, and theoretical aspects have been refined (SHU, in 
particular highlighting that such strategies can minimize the 
search time for a target site by a proper tuning of the pro- 
tein/DNA interaction ifTOifBl . 

All these theoretical approaches rely on a schematic de- 
scription of DNA as a 1 -dimensional linear chain along which 
a protein can diffuse, surrounded by an homogeneous medium 
in which the protein performs regular diffusion. More re- 
cently, crowding effects have been incorporated in these mod- 
els both for the sliding lfl4ll and the bulk cytoplasmic phases 
[4|, leading to more realistic descriptions of gene regulation 
kinetics in prokaryots. 

However, such models are clearly inapplicable to eukary- 
ots, in which the DNA is packed in the cell nucleus lfl5l 
and forms a complex structure called chromatin which is far 
from a simple 1 -dimensional chain. Even if a full bottom-up 
description of the in vivo DNA organization remains out of 
reach, theoretical ideas 1161 . and now growing experimental 
evidences indicate that the chromatin has a hierarchized archi- 
tecture which displays fractal properties at least over the 100 
nm - 10 yt/m range. Indeed, textural image analysis, neutron 
scattering ifTTl . rheology technics ifTHl and more recently the 
Hi-C method [ 19 1 revealed independently a fractal structure of 
the chromatin characterized by a fractal dimension df which 
was found in the range 2.2 - 3. Despite this complex structure 
of the chromatin, the switching dynamics of proteins between 
a DNA bound state and a freely diffusing state, which charac- 
terizes facilitated diffusion in prokaryots, seems to be also at 
work in the nucleus, as evidenced on the examples of histons, 
high-mobility group proteins, and more generally chromatin 
binding proteins {20). This naturally raises the questions of 
determining whether the classical facilitated diffusion mecha- 
nism can be efficient also in the complex nuclear environment, 
and whether it can be used to regulate and optimize gene ex- 
pression in eukaryots. This letter presents a first theoretical 



model which quantitatively addresses these two questions. 




FIG. 1: Facilitated diffusion on chromatin: A chromatin binding pro- 
tein (blue) searches for a target locus (yellow) on chromatin (red). 
The binding (resp. unbinding) rate is denoted by Ai (resp. Ai). 

At the theoretical level, modelling facilitated diffusion in 
the cell nucleus raises two problems: (i) first, to take into ac- 
count the switching dynamics of the protein between a state 
bound to the chromatin and a freely diffusing state in the nu- 
cleoplasm, and (ii) second to model the diffusion phase of a 
protein bound to a complex structure such as chromatin. Point 
(i) has been studied in the context of intermittent search strate- 
gies Ell , and general methods to calculate mean search times 
for intermittent trajectories have been developed. On the other 
hand, the full distribution of the first-passage time (FPT) for 
diffusion in fractals has recently been obtained in ll22l and 
enables to tackle point (ii) under the assumption, backed by 
experiments lfT7HT9ll , that the chromatin is fractal. 

In this letter, we gather and extend these new tools to de- 
velop a theoretical model of facilitated diffusion in the cell 
nucleus. More precisely, we calculate analytically the mean 
search time for a target for a protein which alternates diffu- 
sion phases on the chromatin which is assumed to have a frac- 
tal structure, and free diffusion phases in the nucleoplasm (see 
Fig. [TJ. Under these hypothesis, we show quantitatively that 
facilitated diffusion in eukaryots can significantly speed up the 
search process, and that it enables to minimize the search time 
by tuning the affinity of the protein with DNA. These results 
are qualitatively similar to the case of prokaryots, and suggest 
that facilitated diffusion is a robust mechanism. At the theo- 
retical level, this study yields as a by-product the calculation 
of the distribution of the FPT averaged over the starting point 
for a particle diffusing in a fractal structure, which remained a 
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challenge in the field 

Search time distribution for simple diffusion in a fractal 
medium. We first consider a protein which remains in an ad- 
sorbed state and diffuses on the chromatin, which is modelled 
by a discretized domain D of volume N, and, following ex- 
perimental observations lfl7l4T9ll . is characterized by a fractal 
dimension dt and a typical size R oc N l/df . We calculate here 
the distribution of the search time, defined as the FPT at the 
target, averaged over the starting position of the protein. This 
first technical step is necessary to address the problem of facil- 
itated diffusion discussed in the next paragraph; besides, it is 
an important theoretical question. The protein of position r(f), 
is assumed to perform a symmetric nearest neighbor random 
walk on the chromatin with a constant hopping rate (set to 1). 
To account for the complex organization of DNA-DNA con- 
tact points the chromatin cannot be described as a linear chain: 
it is effectively branched even if the DNA is linear, yielding a 
connectivity potentially larger than 2 which takes into account 
intersegmental transfer. The resulting dynamics is character- 
ized by the walk dimension d w defined through the scaling 
of the mean square displacement with time: ^r 2 (f)^ °c t 2 ^ d ", 
and the nuclear membrane which bounds the chromatin is as- 
sumed to act as reflecting walls. As we proceed to show, the 
search time distribution is independent of these microscopic 
details of the chromatin conformation, and is governed only 
by its larger scale properties which are characterized by df 
and d w . 

We denote by Wj,(t) the propagator, i.e. the probability that 
the protein, starting at site i at t = 0, is at site j at time t, and 
write Wj M for the stationary probability at site j. We will 
make use of the pseudo-Green function of the walk defined 
={) (Wji(t) - Wf l )dt, and the Laplace transform of 

a generic function f(t) will be denoted by f{s). We are here 
interested in the global FPT (GFPT) at a given target site T, 
which is the FPT at site T averaged over the starting site S 
with weight Wf*. We thus define the probability density Oj- 
of the GFPT by ® T (f) = Z%i Wf P T j(t), where P Tj is the 
probability density of the FPT at T starting from a given site 
J- 

General expressions have been derived for the first moment 
of both the FPT (27) and the GFPT (28] (see also Il23ti26l 
for specific examples), and more recently, the higher FPT mo- 
ments have been determined in the large-volume limit N » 1 
11221 . In the case of non-compact exploration (d w < df), it 
reads 
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where (t t ) = H TT /W^ is the mean GFPT (281 . Using next 
that Hji WJ*" = Hy Wy, deduced from detailed balance, and 

averaging over S , we obtain rr£) = nl (tt) u , from which it 
can be deduced immediately that the GFPT distribution is a 
simple exponential of mean (t t ). 

In the compact case (d w > df) however, it can be shown 
that due to a stronger dependence of (t^^ on S Il22ll27l . the 
average over S must be taken before the large N limit, which 
makes the asymptotic form of |22| unusable for this purpose. 



Alternatively, one can make use of the renewal equation 
which reads in Laplace space Pts(s) = Wts(s)/Wtt(s)- Us- 
ing the symmetry relation W ;7 (f) W- m = Wij(t) WT, the aver- 
age over S can be taken and yields an exact expression of 
the GFPT distribution : <b T (s) = /(sW TT (s))- Taking next 
the large volume limit, the propagator Wtt(s) can be evalu- 
ated using the O'Shaughnessy and Procaccia formalism 11301 , 
which leads to 



3>rO) 



4 \ v F(l+v) I v (^A(t t )s) 



A(t t )s) T(l-v) I_ y ^A(t t )s) 



(2) 



where v = df/d w , A = 2(1 - v 2 )/v, and I v (and later J v ) denote 
Bessel functions. Introducing the rescaled variable 9 = t/(Tr), 
one finally obtains by inverse-Laplace transforming Q : 
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where the a^'s are the real zeros of J 
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FIG. 2: Distribution of the GFPT for diffusion in fractal medium in 
the compact case. Numerical simulations for different system sizes 
N (symbols) are plotted against the theoretical prediction of Eq. [3] 
(plain lines), a Sierpinsky gasket (target at the apex) and b critical 
bond percolation cluster embedded in a 3D cubic lattice (average 
taken over random targets and cluster realizations). 

This very general result, confirmed by numerical simula- 
tions on various fractal sets (see fig. |2]and [31 1) shows that the 
GFPT distribution takes a universal form indexed by df and 
d w only for any fractal structure, independently of its micro- 
scopic details. In particular, it shows the applicability of our 
approach to chromatin under the assumption that it is fractal at 
least at sufficiently large scale. As illustrative examples, sim- 
ulations were performed on structures such as critical perco- 
lation clusters, which capture the large scale properties of nu- 
clear DNA organization : they are (i) fractals characterized by 
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well defined d w and df, (ii) naturally embedded in euclidean 
space and (iii) disordered. These fractals can therefore be seen 
as minimal models of chromatin beyond the linear chain de- 
scription. 

Search time distribution for facilitated diffusion in a fractal 
medium. Following the classical picture of facilitated diffu- 
sion, we now consider that the protein can desorb from the 
chromatin with rate A\, and then freely diffuse in the nucleo- 
plasm before rebinding to the chromatin (see Fig. [IJ. In this 
first approach, we adopt a mean field treatment of the phases 
of free diffusion : we assume that the duration of such a phase 
is exponentially distributed with mean t 2 = I/A2, and fur- 
ther suppose that the protein rebinds at a position which is 
uniformly distributed on the chromatin. We determine in this 
paragraph the mean time necessary for the protein starting at 
a random position on the chromatin to reach a target locus 
on the chromatin for the first time. We denote by (T7-) this 
mean GFPT for facilitated diffusion, and by Fjif) the GFPT 
probability density. Using tools developed in the context of 
intermittent search strategies ITTTI |2D , it can be shown that 
the Laplace transform Fj is expressed in terms of the distri- 
bution <bj of the GFPT for simple diffusion on the chromatin, 
which is given by Eq.Q : 
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Assuming that diffusion on the chromatin is compact (as in 
most examples of fractals embedded in 3D space [32]) an us- 
ing expression ^ for O7- finally yields an explicit expression 
of the Laplace transformed distribution of the GFPT: 



F T (s) = (s + Ai)(s + A 2 )I v (x s ) 
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where x s = ~\/A{t t )(s + A t ). One can check that the classical 
result of facilitated diffusion on a 1 -dimensional DNA IHT1I2T1 
is recovered for df = 1 and d w = 2. The mean GFPT (T7-) is 
then readily obtained by writing 
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FIG. 3: Mean search time for facilitated diffusion on a Sierpinski 
gasket of generation 5 (N = 366) with a target at the apex. Numerical 
simulations (symbols) and theoretical prediction (Eq. [7] plain lines) 
of the mean GFPT , rescaled by its value for Ai = 0, is plotted as a 
function of xq = y/A (t t )Ai for various A 2 . 
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FIG. 4: Mean search time for facilitated diffusion on critical bond 
percolation clusters embedded in a 3D cubic lattice with a randomly 
located target. Numerical simulations (symbols) and theoretical pre- 
diction (Eq. [7] plain lines) of the mean GFPT , rescaled by its value 
for A[ = 0, is plotted as a function of x = \jA (r T )A[ for various A 2 
and system size N. 
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which finally yields the central result of this paper: 
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with xq = x s= q. This exact expression of the mean search 
time for facilitated diffusion in a fractal medium, validated by 
numerical simulations on various examples of fractals which 
mimic the large scale properties of chromatin (see Figs [3] [4] 
and ||3TI ) can be simplified as follows in the large limit : 
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Before commenting on this result, note that a limit distribution 
Gt of the GFPT can be obtained by considering the rescaled 
time = t/(TT)oo- Denoting U = s(Tr)co the Laplace vari- 
able associated to 0, and writing Gj(U) - FjiU/iTj)^, 
we find using ^ in the large volume limit Gt(U) ~ (1 + 
U)~ l , which yields immediately the simple exponential form 
G r (0) ~ exp(-0). 

Optimal search time. We now comment on previous results 
and focus on the minimization of the mean search time |7]) 
for facilitated diffusion. First note that (Tt-)^ oc in the 
large volume limit, which shows immediately that facilitated 
diffusion if faster than diffusion alone, which would yield a 
search time scaling as N d "^ df [28 1. Actually, the search time 
can be minimized as a function of the desorption rate A\, as 
soon as the adsorption rate A 2 is large enough. Quantitatively, 
the function (T7-) exhibits a minimum value for some Ai - A'™" 
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if the value of the derivative of (T7) with respect to A\ at Ai = 
is negative. This sets the following condition on A2 : 



A 2 > A'^" 
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which is in practice satisfied for a large enough chromatin vol- 
ume. Under this condition, a direct calculation shows that the 
optimal value Af° can be expanded in the large volume limit 

as: 
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One recovers in particular for df = 1 and d w = 2 the cele- 
brated result Af* ^ Ai IfTOlfTTH . 

Finally, these results show that facilitated diffusion is a ro- 
bust mechanism which can speed up the search for a target 



site even in the case of eukaryots, under the assumption that 
the chromatin has a fractal organization, which seems veri- 
fied experimentally. We add that this approach is indepen- 
dent of the microscopic structure of the DNA and could also 
be valid to some extent in the case of prokaryots, where the 
DNA, even if less densely packed than in eukaryots seems 
to have a rather compact organization significantly departing 
from a linear chain. Using typical experimental values d w ^ 3 
and df ^ 2.5, one finds that the search time is minimized 
for A™ in /A2 — 1/5. This suggests that the adsorption time 
should be significantly larger than the time of free diffusion to 
minimize the search, in contrast with the classical prediction 
ifTUl fTTl . This result is qualitatively compatible with experi- 
mental findings [2] on prokaryots, even if the fractal properties 
of the DNA structure needs to be determined in this case. 
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